Cyberbullying Sentiment Analysis with Word2Vec and One-Against-All Support Vector Machine

نویسندگان

چکیده

Depression and social anxiety are the two main negative impacts of cyberbullying. Unfortunately, a survey conducted by UNICEF on 3rd September 2019 showed that 1 in 3 young people 30 countries had been victims Sentiment analysis research will be to detect comment contains Dataset cyberbullying is obtained from Kaggle website, named, Toxic Comment Classification Challenge. The pre-processing process consists 4 stages, namely generalization (convert text into lowercase remove punctuation), tokenization, stop words removal, lemmatization. Word Embedding used conduct sentiment implementing Word2Vec. After that, One-Against-All (OAA) method with Support Vector Machine (SVM) model make predictions form multi labelling. SVM go through hyperparameter tuning using Randomized Search CV. Then, evaluation carried out Micro Averaged F1 Score assess prediction accuracy Hamming Loss numbers pairs sample label incorrectly classified. Implementation result Word2Vec OAA provide best for data undergoing generalization, lemmatization which stored 100 features model. percentage produced tuned 83.40% 15.13% respectively.
 
 Index Terms— Analysis; Embedding; Word2Vec; One-Against-All; Machine; Challenge; Multi Labelling

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

SIFT, BoW Architecture and one-against-all Support Vector Machine

Abstract. For this first participation to ImageClef Plant Identification, we build on the reference Bag-of-Word framework (BoW). We extract Points-of-Interest (PoI) using the SIFT detector in every image and describe each local feature with the SIFT descriptor. The visual dictionary is built with a K-means algorithm of 100 clusters on the local features. Each image is then represented by its hi...

متن کامل

Forecasting Stock Price Movements Based on Opinion Mining and Sentiment Analysis: An Application of Support Vector Machine and Twitter Data

Today, social networks are fast and dynamic communication intermediaries that are a vital business tool. This study aims at examining the views of those involved with Facebook stocks so that we can summarize their views to predict the general behavior of this stock and collectively consider possible Facebook stock price movements, and create a more accurate pattern compared to previous patterns...

متن کامل

One-Class Support Vector Machine for Sentiment Analysis of Movie Review Documents

Sentiment analysis means to classify a given review document into positive or negative polar document. Sentiment analysis research has been increased tremendously in recent times due to its large number of applications in the industry and academia. Sentiment analysis models can be used to determine the opinion of the user towards any entity or product. E-commerce companies can use sentiment ana...

متن کامل

Fuzzy-Input Fuzzy-Output One-Against-All Support Vector Machines

We present a novel approach for Fuzzy-Input Fuzzy-Output classification. One-Against-All Support Vector Machines are adapted to deal with the fuzzy memberships encoded in fuzzy labels, and to also give fuzzy classification answers. The mathematical background for the modifications is given. In a benchmark application, the recognition of emotions in human speech, the accuracy of our F-SVM approa...

متن کامل

One-against-all multicategory classification via discrete support vector machines

Discrete support vector machines (DSVM), recently proposed in [l01 and [ l l ] for binary classification problems, have been shown to outperform other competing approaches on well-known benchmark datasets. Here we address their extension to multicategory classification, by developing a one-against-all framework in which a set of binary discrimination problems are solved by means of DSVM. Comput...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: International Journal of New Media Technology

سال: 2021

ISSN: ['2355-0082', '2581-1851']

DOI: https://doi.org/10.31937/ijnmt.v8i1.2047